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3 
4 
5 
7 
9 
11 
12 
14 
15 
17 
19 
21 
22 
23 
24 
26 
27 
29 
31 
33 
35 
37 
39 
41 
43 
45 
47 



ENTERED 



<110> APPLICANT: Bristol -Myers Squibb Company 
Chiang, Shu- Jen 
Jonathan, Basch 

<120> TITLE OF INVENTION: DIRECT PRODUCTION OF DESACETYLCEPHALOSPORIN C 
<130> FILE REFERENCE: ON0163 

<140> CURRENT APPLICATION NUMBER: 09/801, 852A 

<141> CURRENT FILING DATE: 2001-03-08 

<150> PRIOR APPLICATION NUMBER: 60/188,033 

<151> PRIOR FILING DATE: 2000-03-09 

<160> NUMBER OF SEQ ID NOS : 15 , 

<170> SOFTWARE: Patentln version 3.0 

<210> SEQ ID NO: 1 

<211> LENGTH: 1716 

<212> TYPE: DNA 

<213> ORGANISM: Rhodosporidium toruloides 
<400> SEQUENCE: 1 

atgctcctta acctcttcac cctcgcctcc ctcgctgcga cgctccagct cgcctttgcc 
tctccgacct ccctcgtccg ccgcacgaac ccaaacgagc cccctcccgt cgtcgacctc 
ggctacgccc gctaccaagg ctacttgaac gagaccgccg gactctactg gtggcgcgga 
atccgctacg cctcggctca gcgcttccag gctcctcaga cgcccgcgac gcacaaggcc 
gtccgcaacg cgactgagta tggaccgatc tgttggccgg ctagcgaggg aaccaacacg 
accaagggct tgccgccgcc tagcaacagc tcgagcagcg cgccgcagaa acaggcgtcg 
gaggattgcc tcttcctcaa tgtcgttgcc cccgccggct cgtgcgaggg cgacaatctt 
cccgtcctcg tctacattca cggaggtggc tacgccttcg gcgatgcgag caccggcagc 

accaagatgg tcgttgtaaa tctccagtac 



gactttgccg ccttcaccaa gcacacggga 
cgtctcggca gctttggttt cctcgctggc 



caagccatga aggactacgg tgtaacgaac 



gccggcttgc ttgaccagca attcgccctt caatgggttc aacagcacgt ctcgaagttc 

ggcgagtctg caggcgcagg gtccgttatg 



49 ggcggcaacc ccgatcacgt tacgatttgg 
51 aaccagatca ttgcgaacgg cggcaacacc 
53 
55 
57 
59 
61 
63 
65 
67 
69 
71 



gtcaaggctc tcggtctcaa 



actgagcgcg cgtcgctcct 
accgggatca acaacctcga 
acgatcagcg accagtcgca 
ccctacatca cctcggagga 
ccgtcaaagg gcaacacctt 
tgcccgacct actggaccgc 



gaagcccctc 
caactccccc 
agccgcctcg 
cgtgaagaac 
gaccttcttg 
caacctcttc 
tcagaacgac 
cggcctcttc 



73 
75 
77 
79 
81 
83 
86 
87 



ttccacgctg ccatcggctc ctccgtcttc ctcccctacc aagccaagta 
ttcgccgagc tgctctactc ccaactcgtc tcggcgacaa actgcaccaa 
tccttcgctt gcctcgaagc tgtcgacgct gcggcgctcg ctgcggcggg 
tcggcggcgt tcccgttcgg gttttggtcg tatgtcccgg tcgtcgacgg 

tctcgccaag ggcaagaaga acctcaatgg 
cgaaggattc atattcactg acgccactat 
gcgcgtctcc cagttcgacc gcctcctcgc 
gcgccaggcc gtcgcgaagc agtacccgat ctccgacgcg 
ctctcgcatc tcggccgtca tcgcggactc gaccttcgtc 
cgaggcgttc ggctcgtccg cccacaaggg cctcttcgac 
tacgcgccgg ctcaccacgc gaccgacaac tcgtactaca tcggctccat ctggaacggc 
aagaagtcgg tctcgtccgt ccagtccttc gacggcgcgc tcggcggctt catcgagacg 
ttcaacccga acaacaacgc tgccaacaag accatcaacc cttactggcc gacgttcgac 
tcgggcaagc agctcctctt caacacgacg acgagggaca ccctctctcc cgccgacccg 
cgcatcgttg agacttcaag cttgaccgac tttggcacga gccagaagac caagtgcgac 
ttctggcgtg ggtcaatctc ggtgaacgcg ggtctc 
<210> SEQ ID NO: 2 
<211> LENGTH: 572 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1716 
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88 <212> TYPE: PRT 

89 <213> ORGANISM: Rhodosporidium toruloides 
91 <400> SEQUENCE: 2 

93 Met Leu Leu Asn Leu Phe Thr Leu Ala Ser Leu Ala Ala Thr Leu Gin 

94 1 5 10 15 

96 Leu Ala Phe Ala Ser Pro Thr Ser Leu Val Arg Arg Thr Asn Pro Asn 

97 20 25 30 

99 Glu Pro Pro Pro Val Val Asp Leu Gly Tyr Ala Arg Tyr Gin Gly Tyr 



100 






35 










40 










45 








102 


Leu 


Asn 


Glu 


Thr 


Ala 


Gly 


Leu 


Tyr 


Trp 


Trp 


Arg 


Gly 


He 


Arg 


Tyr 


Ala 


103 




50 










55 










60 










105 


Ser 


Ala 


Gin 


Arg 


Phe 


Gin 


Ala 


Pro 


Gin 


Thr 


Pro 


Ala 


Thr 


His 


Lys 


Ala 


106 


65 










70 










75 










80 


108 


Val 


Arg 


Asn 


Ala 


Thr 


Glu 


Tyr 


Gly 


Pro 


He 


Cys 


Trp 


Pro 


Ala 


Ser 


Glu 


109 










,85 










90 










95 




111 


Gly 


Thr 


Asn 


Thr 


Thr 


Lys 


Gly 


Leu 


Pro 


Pro 


Pro 


Ser 


Asn 


Ser 


Ser 


Ser 


112 








100 










105 










110 






114 


Ser 


Ala 


Pro 


Gin 


Lys 


Gin 


Ala 


Ser 


Glu 


Asp 


Cys 


Leu 


Phe 


Leu 


Asn 


Val 


115 






115 










120 










125 








117 


Val 


Ala 


Pro 


Ala 


Gly 


Ser 


Cys 


Glu 


Gly 


Asp 


Asn 


Leu 


Pro 


Val 


Leu 


Val 


118 




130 










135 










140 










120 


Tyr 


He 


His 


Gly 


Gly 


Gly 


Tyr 


Ala 


Phe 


Gly 


Asp 


Ala 


Ser 


Thr 


Gly 


Ser 


121 


145 










150 










155 










160 


123 


Asp 


Phe 


Ala 


Ala 


Phe 


Thr 


Lys 


His 


Thr 


Gly 


Thr 


Lys 


Met 


Val 


Val 


Val 


124 










165 










170 










175 




126 


Asn 


Leu 


Gin 


Tyr 


Arg 


Leu 


Gly 


Ser 


Phe 


Gly 


Phe 


Leu 


Ala 


Gly 


Gin 


Ala 


127 








180 










185 










190 






129 


Met 


Lys 


Asp 


Tyr 


Gly 


Val 


Thr 


Asn 


Ala 


Gly 


Leu 


Leu 


Asp 


Gin 


Gin 


Phe 


130 






195 










200 










205 








132 


Ala 


Leu 


Gin 


Trp 


Val 


Gin 


Gin 


His 


Val 


Ser 


Lys 


Phe 


Gly 


Gly 


Asn 


Pro 


133 




210 










215 










220 










135 


Asp 


His 


Val 


Thr 


He 


Trp 


Gly 


Glu 


Ser 


Ala 


Gly 


Ala 


Gly 


Ser 


Val 


Met 


136 


225 










230 










235 










240 


138 


Asn 


Gin 


He 


He 


Ala 


Asn 


Gly 


Gly 


Asn 


Thr 


Val 


Lys 


Ala 


Leu 


Gly 


Leu 


139 










245 










250 










255 




141 


Lys 


Lys 


Pro 


Leu 


Phe 


His 


Ala 


Ala 


He 


Gly 


Ser 


Ser 


Val 


Phe 


Leu 


Pro 


142 








260 










265 










270 






144 


Tyr 


Gin 


Ala 


Lys 


Tyr 


Asn 


Ser 


Pro 


Phe 


Ala 


Glu 


Leu 


Leu 


Tyr 


Ser 


Gin 


145 






275 










280 










285 








147 


Leu 


Val 


Ser 


Ala 


Thr 


Asn 


Cys 


Thr 


Lys 


Ala 


Ala 


Ser 


Ser 


Phe 


Ala 


Cys 


148 




290 










295 










300 










150 


Leu 


Glu 


Ala 


Val 


Asp 


Ala 


Ala 


Ala 


Leu 


Ala 


Ala 


Ala 


Gly 


Val 


Lys 


Asn 


151 


305 










310 










315 










320 


153 


Ser 


Ala 


Ala 


Phe 


Pro 


Phe 


Gly 


Phe 


Trp 


Ser 


Tyr 


Val 


Pro 


Val 


Val 


Asp 


154 










325 










330 










335 




156 


Gly 


Thr 


Phe 


Leu 


Thr 


Glu 


Arg 


Ala 


Ser 


Leu 


Leu 


Leu 


Ala 


Lys 


Gly 


Lys 


157 








340 










345 










350 






159 


L ys 


Asn 


Leu 


Asn 


Gly 


Asn 


Leu 


Phe 


Thr 


Gly 


He 


Asn 


Asn 


Leu 


Asp 


Glu 


160 






355 










360 










365 
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162 


Gly 


Phe He Phe 


Thr 


Asp 


Ala 


Thr 


He 


Gin 


Asn 


Asp 


Thr 


He Ser 


Asp 




163 




370 










375 










380 












165 


Gin 


Ser Gin Arg 


Val 


Ser 


Gin 


Phe 


Asp 


Arg 


Leu 


Leu 


Ala 


Gly Leu 


Phe 




166 


385 








390 










395 










400 




168 


Pro 


Tyr He Thr 


Ser 


Glu 


Glu 


Arg 


Gin 


Ala 


Val 


Ala 


Lys 


Gin Tyr 


Pro 




169 








405 












410 










415 






171 


He 


Ser Asp Ala 


Pro 


Ser 


Lys 


Gly 


Asn 


Thr 


Phe 


Ser 


Arg 


He Ser 


Ala 




172 






420 












425 










430 






174 


Val 


He Ala Asp 


Ser 


Thr 


Phe 


Val 


Cys 


Pro 


Thr 


Tyr 


Trp 


Thr Ala 


Glu 




175 




435 










440 










445 










177 


Ala 


Phe Gly Ser 


Ser 


Ala 


His 


Lys 


Gly 


Leu 


Phe 


Asp 


Tyr 


Ala Pro 


Ala 




178 




450 










455 










460 












180 


His 


His Ala Thr 


Asp 


Asn 


Ser 


Tyr 


Tyr 


He 


Gly 


Ser 


He 


Trp Asn 


Gly 




181 


465 








470 










475 










480 




183 


Lys 


Lys Ser Val 


Ser 


Ser 


Val 


Gin 


Ser 


Phe 


Asp 


Gly 


Ala 


Leu Gly 


Gly 




184 








485 












490 










495 






186 


Phe 


He Glu Thr 


Phe 


Asn 


Pro 


Asn 


Asn 


Asn 


Ala 


Ala 


Asn 


Lys Thr 


He 




187 






500 












505- 










510 






189 


Asn 


Pro Tyr Trp 


Pro 


Thr 


Phe 


Asp 


Ser 


Gly 


Lys 


Gin 


Leu 


Leu Phe 


Asn 




190 




515 










520 










525 










192 


Thr 


Thr Thr Arg 


Asp 


Thr 


Leu 


Ser 


Pro 


Ala 


Asp 


Pro 


Arg 


He Val 


Glu 




193 




530 










535 










540 












195 


Thr 


Ser Ser Leu 


Thr 


Asp 


Phe 


Gly 


Thr 


Ser 


Gin 


Lys 


Thr 


Lys Cys 


Asp 




196 


545 








550 










555 










560 




198 


Phe 


Trp Arg Gly 


Ser 


He 


Ser 


Val 


Asn 


Ala 


Gly 


Leu 












199 








565 












570 
















201 


<210> SEQ : 


CD NO: 


: 3 




























202 


<211> LENGTH: 2220 




























203 


<212> TYPE 


: DNA 






























204 


<213> ORGANISM: 


Rhodosporidium toruloides 














206 


<400> SEQUENCE: 


3 




























207 


ggatccaccc 


gaactctgtc 


ccgctttctg 


gctttcttcc 


ttgctgtcgc 


cccatcgcct 


60 


209 


ttcccgactc 


gccgccatgc 


tccttaacct 


cttcaccctc 


gcctccctcg 


ctgcgacgct 


120 


211 


ccagctcgcc 


tttgcctctc 


cgacctccct 


cgtccgccgc 


acgaacccaa 


acgagccccc 


180 


213 


tcccgtcgtc 


gacctcggct 


acgcccgcta 


ccaaggctac 


ttgaacgaga 


ccgccggact 


240 


215 


ctactggtgg 


cgcggaatcc 


gctacgcctc 


ggctcagcgc 


ttccaggctc 


ctcagacgcc 


300 


217 


cgcgacgcac 


aaggccgtcc 


gcaacgcgac 


tgagtatgga 


ccgatctgtt 


ggccggctag 


360 


219 


cgagggaacc 


aacacgacca 


agggcttgcc 


gccgcctagc 


aacagctcga 


gcagcgcgcc 


420 


221 


gcagaaacag 


gcgtcggagg 


attgcctctt 


cctcaatgtc 


gttgcccccg 


ccggctcgtg 


480 


223 


cgagggcgac 


aatcttcccg 


tcctcgtcta 


cattcacgga 


ggtggctacg 


ccttcggcga 


540 


225 


tgcgagcacc 


ggcagcgact 


ttgccgcctt 


caccaagcac 


acgggaacca 


agatggtcgt 


600 


227 


tgtaaatctc 


cagtaccgtc 


tcggcagctt 


tggtttcctc 


gctggccaag 


ccatgaagga 


660 


229 


ctacggtgta 


acgaacgccg 


gcttgcttga 


ccaggtgagt 


ttcccgcatg 


atacccgccc 


720 


231 


acctttcgac 


tcatgctgac 


gcctctcccg 


ctcgcagcaa 


ttcgcccttc 


aatgggttca 


780 


233 


acagcacgtc 


tcgaagttcg 


gcggcaaccc 


cgatcacgtt 


acgatttggg 


gcgagtctgc 


840 


235 


aggcgcaggg 


tccgttatga 


accagatcat 


tgcgaacgtg 


agccacccga 


accgatctcc 


900 


237 


agccgacttt 


cccccccccc 


ccccccccgc 


tgacctccct 


cgtcttgcag 


ggcggcaaca 


960 


239 


ccgtcaaggc 


tctcggtctc 


aagaagcccc 


tcttccacgc 


tgccatcggc 


tcctccgtct 


1020 


241 


tcctccccta 


ccaagccaag 


tacaactccc 


ccttcgccga 


gctgctctac 


tcccaactcg 


1080 
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243 tctcggcgac aaactgcacc aaagccgcct cgtccttcgc ttgcctcgaa gctgtcgacg 

245 ctgcggcgct cgctgcggcg ggcgtgaaga actcggcggc gttcccgttc gggttttggt 

247 cgtatgtccc ggtcgtcgac gggaccttct tgactgagcg cgcgtcgctc cttctcgcca 

249 agggcaagaa gaacctcaat ggcgtgcgtg gcgagctttc gagtgcttca ggatctcgct 

251 gacactgtcg accggctcgc agaacctctt caccgggatc aacaacctcg acgaagatga 

253 gttcccgtcg acggctctgt tcgcccagcg agactgactt gttcttttgc gaagattacg 

255 attcatattc actgacgcca ctattcagaa cgacacgatc agcgaccagt cgcagcgcgt 

257 ctcccagttc gaccgcctcc tcgccggcct cttcccctac atcacctcgg aggagcgcca 

259 ggccgtcgcg aagcagtacc cgatctccga cgcgccgtca aagggcaaca ccttctctcg 

261 catctcggcc gtcatcgcgg actcgacctt cgtgtgcgtt ccccgtcgtc ttctccgagt 

263 attccgctga cttcccgctt gcccgcagct gcccgaccta ctggaccgcc gaggcgttcg 

26 5 gctcgtccgc ccacaagggc ctcttcgact acgcgccggc tcaccacgcg accgacaact 

267 cgtactacat cggctccatc tggaacggca agaagtcggt ctcgtccgtc cagtccttcg 

269 acggcgcgct cggcggcttc atcgagacgt tcaacccgaa caacaacgct gccaacaaga 

271 ccatcaaccc ttactggccg acgttcgact cgggcaagca gctcctcttc aacacgacga 

273 cgagggacac cctctctccc gccgacccgc gcatcgttga gacttcaagc ttgaccgact 

275 ttggcacgag ccagaagacc aagtgcgact tctggcgtgg gtcaatctcg gtgaacgcgg 

2 77 gtctctaggc gtctttcctt ccgacttcct tcgttctttc gttgtttatt cttgcagttc 

279 cgttgtatcg gccattcgtg cgtgtagctc actcgagtat agacgttggc aagtgcgaaa 

282 <210> SEQ ID NO: 4 

283 <211> LENGTH: 544 

284 <212> TYPE: PRT 

285 <213> ORGANISM: Rhodosporidium toruloides 
287 <400> SEQUENCE: 4 



289 


Thr 


Asn 


Pro 


Asn 


Glu 


Pro 


Pro 


Pro 


Val 


Val 


Asp 


Leu 


Gly 


Tyr 


Ala 


Arg 


290 


1 








5 










10 










15 




292 


Tyr 


Gin 


Gly 


Tyr 


Leu 


Asn 


Glu 


Thr 


Ala 


Gly 


Leu 


Tyr 


Trp 


Trp 


Arg 


Gly 


293 








20 










25 










30 






295 


He 


Arg 


Tyr 


Ala 


Ser 


Ala 


Gin 


Arg 


Phe 


Gin 


Ala 


Pro 


Gin 


Thr 


Pro 


Ala 


296 






35 










40 










45 








298 


Thr 


His 


Lys 


Ala 


Val 


Arg 


Asn 


Ala 


Thr 


Glu 


Tyr 


Gly 


Pro 


He 


Cys 


Trp 


299 




50 










55 










60 










301 


Pro 


Ala 


Ser 


Glu 


Gly 


Thr 


Asn 


Thr 


Thr 


Lys 


Gly 


Leu 


Pro 


Pro 


Pro 


Ser 


302 


65 










70 










75 










80 


304 


Asn 


Ser 


Ser 


Ser 


Ser 


Ala 


Pro 


Gin 


Lys 


Gin 


Ala 


Ser 


Glu 


Asp 


Cys 


Leu 


305 










85 










90 










95 




307 


Phe 


Leu 


Asn 


Val 


Val 


Ala 


Pro 


Ala 


Gly 


Ser 


Cys 


Glu 


Gly 


Asp 


Asn 


Leu 


308 








100 










105 










110 






310 


Pro 


Val 


Leu 


Val 


Tyr 


lie 


His 


Gly 


Gly 


Gly 


Tyr 


Ala 


Phe 


Gly 


Asp 


Ala 


311 






115 










120 










125 








313 


Ser 


Thr 


Gly 


Ser 


Asp 


Phe 


Ala 


Ala 


Phe 


Thr 


Lys 


His 


Thr 


Gly 


Thr 


Lys 


314 




130 










135 










140 










316 


Met 


Val 


Val 


Val 


Asn 


Leu 


Gin 


Tyr 


Arg 


Leu 


Gly 


Ser 


Phe 


Gly 


Phe 


Leu 


317 


145 










150 










155 










160 


319 


Ala 


Gly 


Gin 


Ala 


Met 


Lys 


Asp 


Tyr 


Gly 


Val 


Thr 


Asn 


Ala 


Gly 


Leu 


Leu 


320 










165 










170 










175 




322 


Asp 


Gin 


Gin 


Phe 


Ala 


Leu 


Gin 


Trp 


Val 


Gin 


Gin 


His 


Val 


Ser 


Lys 


Phe 


323 








180 










185 










190 







325 Gly Gly Asn Pro Asp His Val Thr He Trp Gly Glu Ser Ala Gly Ala 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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326 






195 










200 










205 








328 


Gly 


Ser 


Val 


Met 


Asn 


Gin 


He 


He 


Ala 


Asn 


Gly 


Gly 


Asn 


Thr 


Val 


Lys 


329 




210 










215 










220 










331 


Ala 


Leu 


Gly 


Leu 


Lys 


Lys 


Pro 


Leu 


Phe 


His 


Ala 


Ala 


lie 


Gly 


Ser 


Ser 


332 


225 










230 










235 










240 


334 


Val 


Phe 


Leu 


Pro 


Tyr 


Gin 


Ala 


Lys 


Tyr 


Asn 


Ser 


Pro 


Phe 


Ala 


Glu 


Leu 


335 










245 










250 










255 




337 


Leu 


Tyr 


Ser 


Gin 


Leu 


Val 


Ser 


Ala 


Thr 


Asn 


Cys 


Thr 


Lys 


Ala 


Ala 


Ser 


338 








260 










265 










270 






340 


Ser 


Phe 


Ala 


Cys 


Leu 


Glu 


Ala 


Val 


Asp 


Ala 


Ala 


Ala 


Leu 


Ala 


Ala 


Ala 


341 






275 










280 










285 








343 


Gly Val 


Lys 


Asn 


Ser 


Ala 


Ala 


Phe 


Pro 


Phe 


Gly 


Phe 


Trp 


Ser 


Tyr 


Val 


344 




290 










295 










300 










346 


Pro 


Val 


Val 


Asp 


Gly 


Thr 


Phe 


Leu 


Thr 


Glu 


Arg 


Ala 


Ser 


Leu 


Leu 


Leu 


347 


305 










310 










315 










320 


349 


Ala 


Lys 


Gly 


Lys 


Lys 


Asn 


Leu 


Asn 


Gly 


Asn 


Leu 


Phe 


Thr 


Gly 


He 


Asn 


350 










325 










330 










335 




352 


Asn 


Leu 


Asp 


Glu 


Gly 


Phe 


He 


Phe 


Thr 


Asp 


Ala 


Thr 


He 


Gin 


Asn 


Asp 


353 








340 










345 










350 






355 


Thr 


He 


Ser 


Asp 


Gin 


Ser 


Gin 


Arg 


Val 


Ser 


Gin 


Phe 


Asp 


Arg 


Leu 


Leu 


356 






355 










360 










365 








358 


Ala 


Gly 


Leu 


Phe 


Pro 


Tyr 


He 


Thr 


Ser 


Glu 


Glu 


Arg 


Gin 


Ala 


Val 


Ala 


359 




370 










375 










380 










361 


Lys 


Gin 


Tyr 


Pro 


He 


Ser 


Asp 


Ala 


Pro 


Ser 


Lys 


Gly 


Asn 


Thr 


Phe 


Ser 


362 


385 










390 










395 










400 


364 


Arg 


He 


Ser 


Ala 


Val 


He 


Ala 


Asp 


Ser 


Thr 


Phe 


Val 


Cys 


Pro 


Thr 


Tyr 


365 










405 










410 










415 




367 


Trp 


Thr 


Ala 


Glu 


Ala 


Phe 


Gly 


Ser 


Ser 


Ala 


His 


Lys 


Gly 


Leu 


Phe 


Asp 


368 








420 










425 










430 






370 


Tyr 


Ala 


Pro 


Ala 


His 


His 


Ala 


Thr 


Asp 


Asn 


Ser 


Tyr 


Tyr 


He 


Gly 


Ser 


371 






435 










440 










445 








373 


He 


Trp 


Asn 


Gly 


Lys 


Lys 


Ser 


Val 


Ser 


Ser 


Val 


Gin 


Ser 


Phe 


Asp 


Gly 


374 




450 










455 










460 










376 


Ala 


Leu 


Gly 


Gly 


Phe 


He 


Glu 


Thr 


Phe 


Asn 


Pro 


Asn 


Asn 


Asn 


Ala 


Ala 


377 


465 










470 










475 










480 


379 


Asn 


Lys 


Thr 


He 


Asn 


Pro 


Tyr 


Trp 


Pro 


Thr 


Phe 


Asp 


Ser 


Gly 


Lys 


Gin 


380 










485 










490 










495 




382 


Leu 


Leu 


Phe 


Asn 


Thr 


Thr 


Thr 


Arg 


Asp 


Thr 


Leu 


Ser 


Pro 


Ala 


Asp 


Pro 


383 








500 










505 










510 






385 


Arg 


He 


Val 


Glu 


Thr 


Ser 


Ser 


Leu 


Thr 


Asp 


Phe 


Gly 


Thr 


Ser 


Gin 


Lys 


386 






515 










520 










525 








388 


Thr 


Lys 


Cys 


Asp 


Phe 


Trp 


His 


Gly 


Ser 


He 


Ser 


Val 


Asn 


Ala 


Gly 


Leu 


389 




530 










535 










540 










391 


<210> SEQ ID NO: 


5 
























392 


<211> LENGTH: 15 
























393 


<212> TYPE: 


PRT 


























394 


<213> ORGANISM: 


Rhodosporidium toruloides 












396 


<4 00> SEQUENCE: 


5 
























398 


Thr 


Asn 


Pro 


Asn 


Glu 


Pro 


Pro 


Pro 


Val 


Val 


Asp 


Leu 


Gly 


Tyr 


Ala 
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